System for downloading multimedia content and associated process

ABSTRACT

The invention relates to a system for downloading multimedia content via a mobile telephony network ( 10 ) to a terminal ( 50; 60, 70 ), comprising a voice recognition device ( 40 ), a database ( 30 ) connected to the network ( 10 ) and containing multimedia files. The terminal ( 50; 60, 70 ) transmits a voice request to the voice recognition device ( 40 ) and the voice recognition device ( 40 ) interprets the request that it receives. After the request has been received and interpreted by the voice recognition device ( 40 ), one or more interpretation prompt(s) designating one or more file(s) contained in the database ( 30 ) are sent to the terminal ( 50; 60, 70 ). The terminal is able to return a prompt selected, thereby bringing about the downloading of a file corresponding to the prompt selected from the database ( 30 ) to the terminal ( 50; 60, 70 ) via the mobile telephony network ( 10 ).

[0001] The invention relates to the field of browsing the Internet orany other network (GSM, GPRS, UMTS) regardless of the protocol used(WAP, I-Mode, etc.) by virtue of a mobile terminal or a computer.

[0002] Systems are known which make it possible to access Internet sitesin which the user connects to a server that allows him to establishcontact with other servers and to obtain information.

[0003] The document U.S. Pat. No. 6,101,473 describes a browsing systemcomprising a web server, a PC type computer terminal including a webbrowser and a voice recognition device coupling the server with astandard telephony network (of wire type). The voice recognition deviceis able to interpret a browsing voice command issued by the user fromhis telephone set linked to the standard telephony network and tocontrol the web server as a function of this interpretation. The webserver is able to return graphical data to the computer terminal, as afunction of the voice command issued by the user. This browsing systemallows a user to browse the Internet by formulating natural-languagebrowsing or downloading orders from his fixed telephone.

[0004] By virtue of such systems, the user can download multimediacontent to his computer terminal by verbally formulating his requestfrom his fixed telephone set.

[0005] A drawback of these systems is that they are based on theparallel use of a computer terminal and a fixed telephone and that theycomprise relatively complex means of operation.

[0006] Another drawback of these systems is that the voice recognitiondevice does not always carry out correct interpretation of the user'srequests. In particular, when the user is in a noisy environment, hisvoice may be distorted and the content of his request may be impairedthereby.

[0007] It follows that the multimedia content that he receives does notcorrespond to what he requested.

[0008] This drawback is particularly detrimental in the case where theuser orders the downloading of a film, of a video or sound sequence(radio or television transmission), of an animation, of a program,.etc.

[0009] Specifically, the downloading of the files may prove to berelatively lengthy and hence expensive for the user.

[0010] Moreover, the downloading of content may form part of a payservice.

[0011] This is why it is desirable for the user's requests to becorrectly interpreted so as to avoid any futile downloading.

[0012] For this purpose, the invention proposes a system for downloadingmultimedia content to a terminal, characterized in that the downloadingis carried out via a mobile telephony network, the said terminal beingable to be connected to the mobile telephony network, the said systemcomprising a voice recognition device, a database connected to thenetwork and containing multimedia files, the terminal being able totransmit a voice request emanating from the user to the voicerecognition device and the voice recognition device is able to interpretthe request that it receives and to return to the terminal one or moreinterpretation prompt(s) designating one or more file(s) contained inthe database, the terminal being able to return a prompt selected by theuser, thereby bringing about the downloading of a multimedia filecorresponding to the prompt selected from the database to the terminalvia the mobile telephony network.

[0013] This system advantageously allows the user to verify that hisrequest has been correctly interpreted before confirming the downloadingof a file. This system therefore avoids any futile downloading.

[0014] This system applies to terminals such as mobile telephonesequipped with Internet browsers, computers linked to or incorporating aterminal for connection to the mobile network, electronic diaries,personal assistants, etc., able to exchange information via the mobiletelephony network and to receive data files.

[0015] These terminals make it possible to browse the Internet, todownload data, to use specific means of selection comprising for examplea touch screen and a stylus.

[0016] By using such terminals, the user confirms his request in asimple and fast manner before the downloading of the correspondingmultimedia content is performed.

[0017] In an implementation of the invention, the voice recognitiondevice is able to generate and transmit to the terminal a listcontaining several most probable interpretation prompts.

[0018] The prompts can be transmitted to the terminal in the form ofhypertext links tied with multimedia files contained in the database,the user being able to activate the link corresponding to his request.

[0019] Advantageously, the prompts being associated with probabilitiesof correspondence with the user's request, they may be ranked accordingto their order of probability.

[0020] This arrangement allows a further reduction in the time requiredby the user to choose the content that he desires to download.

[0021] Advantageously, the system may comprise means for recording thevoice request.

[0022] Advantageously, the terminal is a mobile terminal having a voicechannel (via which analogue signals or digital data can travel) and/or adata channel.

[0023] In an implementation of the invention, the system comprises meansfor activating or deactivating the mode of operation with return ofinterpretation prompt(s) to the terminal and:

[0024] in the case where this mode of operation is activated, the voicerecognition device is able to return one or more interpretationprompt(s) to the terminal,

[0025] in the case where this mode of operation is deactivated, thevoice recognition device is able to transmit an interpretation directlyto a server for access to the database.

[0026] Advantageously, the terminal comprises means for measuring aparameter relating to the quality of the network and as a function ofthis parameter, activating or deactivating the mode of operation withreturn of prompt(s).

[0027] Alternatively, the means for activating or deactivating the modeof operation with return of prompt(s) to the terminal can be actuated bya user of the terminal.

[0028] The invention also relates to a process for downloadingmultimedia content to a terminal, characterized in that the downloadingis carried out via a mobile telephony network, the said terminal beingable to be connected to the mobile telephony network, said processcomprising the steps according to which:

[0029] a user transmits a signal corresponding to a verbal request to avoice recognition device from a terminal via the mobile telephonynetwork,

[0030] the voice recognition device processes the signal and returns tothe terminal one or more interpretation prompt(s) designating one ormore multimedia file(s) contained in a database connected to thenetwork,

[0031] the user selects the prompt corresponding to the verbal request,thereby bringing about the downloading of a multimedia filecorresponding to the prompt selected from the database to the terminalvia the mobile telephony network.

[0032] The voice request signal may be a voice or data signal.

[0033] In an implementation of the invention, the prompts are returnedfrom the database to the terminal in the form of a text message.

[0034] In another implementation of the invention, the prompts arereturned from the database to the terminal in the form of a voicemessage transmitted as a sound file or by audio streaming.

[0035] Advantageously, the prompts are presented by the terminal in adescending order of probability of correspondence with the request.

[0036] In an implementation of the invention, a prompt is selected bypositioning a cursor over this prompt then by pressing an enable key ofa keypad associated with the terminal.

[0037] In the case where the telephone is fitted with a touch screen(allowing entry of information by simply pressing or moving the fingeron the screen), a prompt is selected by positioning a stylus on thetouch screen at the level of the relevant prompt.

[0038] In another implementation of the invention, a prompt is selectedby scrolling prompts down to the one whose selection is desired and thenby pressing an enable key of a keypad associated with the terminal.

[0039] In yet another implementation of the invention, a prompt isselected by pressing a key of a keypad associated with the terminalidentifying the prompt.

[0040] In yet another implementation, a prompt is selected by verballypronouncing a reference identifying this prompt.

[0041] When none of the prompts is selected, the operation of processingthe request by the voice recognition device is repeated whileeliminating the unselected prompts from a list of expressions that thevoice recognition device may comprise.

[0042] Having recorded the voice request beforehand, this new processingoperation may be carried out on the basis of the initial recordedrequest.

[0043] Alternatively, this new processing operation may be carried outon a new request.

[0044] When none of the prompts is selected, the new request may beformulated in text or graphics mode.

[0045] In an implementation of the invention, a mode of operation withreturn of prompt(s) to the terminal is activated beforehand.

[0046] Other characteristics and advantages will emerge further from thedescription which follows, which is purely illustrative and nonlimitingand should be read in conjunction with FIG. 1 appended which is adiagrammatic representation of a downloading system in accordance withan embodiment of the invention.

[0047] In FIG. 1, the downloading system uses a mobile telephony network10. This system comprises an access server 20 connected to the mobiletelephony network 10. This access server 20 is also connected to adatabase 30 containing a collection of multimedia files and to a voicerecognition and synthesis device 40.

[0048] Users can use the downloading system by means of a terminal ableto exchange information via the mobile telephony network 10 and toreceive data files. This may for example be a mobile telephone 50equipped with a WEB or WAP Internet browser, or else a computer 60linked to or incorporating a terminal 70 for connection to the mobilenetwork 10. The terminal is in particular able to receive and interpretHTML pages. It is able to display hypertext and hypermedia linksactivatable by the user.

[0049] The terminal can also take the form of an electronic diary orpersonal assistant. These appliances generally comprise a touch screenand a stylus allowing the user to write on the screen or to selectcommands.

[0050] The user of a mobile telephone 50 searching for a multimediacontent can connect up to the server 20 by browsing the Internet (usingWAP, I-Mode or any other protocol) or by ordering direct access to thisserver 20. The mobile telephone 50 comprises a data channel and possiblya voice channel.

[0051] An HTML page is displayed on the screen of the mobile telephone50 indicating to the user that he can search for a content byformulating a verbal request. This content may consist of a filecontaining a film, a video or sound sequence (radio or televisiontransmission, music), an animation, a program, etc.

[0052] For example, if the user desires to download a film, he says thetitle of this film. His request is transmitted to the access server 20in the form of a voice message or in the form of data packets. In thelatter case, the mobile telephone 50 converts the voice signal of theuser into data. The access server 20 records the request and transmitsit to the voice recognition device 40. The voice recognition device 40receives and interprets the user's request.

[0053] According to a first “without help” mode of operation, the voicerecognition device 40 returns the most probable interpretation of theuser's request to the access server 20. The access server 20 then ordersthe downloading of the file corresponding to the user's choice from thedatabase 30 to the mobile telephone 50.

[0054] According to a second “with help” mode of operation, the voicerecognition device 40 returns a collection of interpretation promptsinterpreting the user's request to the access server 20. These promptscorrespond to titles of films available in the database 30. Each ofthese prompts is associated with a probability of correspondence withthe user's request.

[0055] According to a first variant of his system, the access server 20transmits the collection of prompts to the mobile telephone 50 via thedata channel in the form of a text message that is displayed on thescreen of the telephone 50. These prompts are associated withprobabilities of correspondence with the user's request and aredisplayed on the screen in descending order of probabilities.

[0056] The user of the terminal verifies that his request is indeedamong the prompts displayed on his screen.

[0057] The user can select one of the prompts by moving a cursor ontothe prompt in which he is interested or by designating the latter with astylus or by scrolling the prompts and then enabling his choice bypressing an enable key of his keypad.

[0058] Alternatively, the prompts can be referenced by index numbers.

[0059] In this case, the user can select one of the prompts by typing inthe index number of the prompt which interests him and by enabling hischoice by pressing an enable key of his keypad.

[0060] He can also verbally pronounce the index number of the promptwhich interests him.

[0061] The mobile telephone 50 returns the selected prompt to the accessserver 20. The access server 20 orders the downloading of the filecorresponding to the user's choice from the database 30 to the mobiletelephone 50.

[0062] Advantageously, the prompts can be sent by the access server 20to the mobile -telephone 50 in the form of hypertext links tied withfiles of multimedia contents from the database 30. The user can activatethe link which is of interest to him. The mobile telephone 50 is thendirectly tied to the database for downloading of the selected file.

[0063] This file can be downloaded in compressed form. In this case,this file may be read by decompression software (a content reader). Suchsoftware can for example be included in the mobile telephone 50.

[0064] The access server 20 records data relating to the downloadingoperation and the identity of the mobile telephone 50 in a register.This register will serve to bill the user for the downloading service.For example, this service may be invoiced directly on his telephone billor deducted from his subscription.

[0065] In the case where none of the prompts corresponds to his request,the user has the possibility of selecting the “none of these prompts”prompt. The mobile telephone 50 returns a cue to the access server 20,indicating that the user is not satisfied with the prompts proposed tohim. The access server 20 again sends the user's request that it hasrecorded the voice recognition device. The voice recognition deviceproceeds to a new recognition on the basis of this recording byeliminating the interpretation prompts that the user has not selected.

[0066] This elimination consists in eliminating from a list ofexpressions that the voice recognition device may comprise, the promptsnot selected by the user.

[0067] According to a second variant of this system, the access servertransmits the collection of prompts in the form of a voice message tothe mobile telephone 50. This message may be transported on the voicechannel or the data channel. In both cases, it may be downloaded to themobile telephone in the form of a sound file or else transmitted by“audio streaming” (sending and reading in real time of compressed audiodata). As previously, these prompts are pronounced from the mostprobable to the least probable. The user selects one of the prompts bypressing a key of the terminal corresponding to the prompt that hedesires to select.

[0068] Alternatively, the various prompts may be pronounced insuccession and the user accesses the next prompt or validates the promptjust pronounced by pressing certain keys of his keypad.

[0069] Alternatively again, the user selects a prompt by verballypronouncing a reference identifying this prompt. The prompts can forexample be associated with letters or with index numbers. The user thenpronounces the index number or the letter of the prompt in which he isinterested.

[0070] According to a third variant of this system, the access server 20transmits the collection of prompts via the data channel in the form ofdata as well as an associated voice message. The prompts are displayedon the screen of the mobile telephone 50 while the mobile terminal 50reads the voice message. The voice message can consist of the list ofprompts displayed or any other indication for the attention of the user.It may be a message of the type: “Please select one of the promptsdisplay by clicking on the one that corresponds to your request”. Thevoice message can be generated within the network (for example the voicerecognition device 40 comprises a voice synthesis module) or by thetelephone 50 itself.

[0071] In the three variants set forth above, the downloading systemallows the user to verbally formulate his requests and to obtain agraphical reply simultaneously. The transmission and the displaying ofprompts may be carried out even if no telephonic communication isestablished between the terminal 50 and the network 10. It follows thatthe voice channel is not permanently open.

[0072] When after several attempts, the voice recognition has notsucceeded, the access server 20 prompts the user to enter his requestusing the keypad of the terminal 50 or to spell out the word or wordscorresponding to his request. In the case where the user spells out theword or words corresponding to his request, the voice recognition device40 goes to alphanumeric recognition mode.

[0073] The mode of operation of the device “with help” is notnecessarily activated permanently. In particular, in the case where thelink between the terminal 50 and the network 10 is of good quality, theusers' requests are generally transmitted and correctly interpreted bythe voice recognition device 40. The mobile telephone 50 can comprisemeans for measuring a parameter relating to the quality of theterminal/network link and, as a function of the result of thismeasurement, for activating or deactivating the mode of operation “withhelp” of the downloading system.

[0074] This activation or deactivation of the mode of operation “withhelp” may also be carried out by the user himself.

[0075] Of course, the system described above may be implemented with acomputer 60 instead of the mobile telephone 50. In this case, thecomputer 60 must be linked to or must incorporate a terminal 70 forconnection to the mobile network 10 as well as to sound capture means.

[0076] The system may also be implemented with an electronic diary or apersonal assistant that can be connected to a mobile telephony network.

[0077] Certain diaries or computers comprise a write recognition touchscreen. If the user's request is poorly interpreted by the voicerecognition device, the user can write his request on the touch screen.

[0078] It will be understood that the above-described contentdownloading system may be implemented with any equipment allowing accessto a mobile telephony network.

1. System for downloading multimedia content to a terminal (50; 60, 70),characterized in that the downloading is carried out via a mobiletelephony network (10), the said terminal (50; 60, 70) being able to beconnected to the mobile telephony network (10), the said systemcomprising a voice recognition device (40), a database (30) connected tothe network (10) and containing multimedia files, the terminal (50; 60,70) being able to transmit a voice request emanating from the user tothe voice recognition device (40) and the voice recognition device (40)is able to interpret the request that it receives and to return to theterminal (50; 60, 70) one or more interpretation prompt(s) designatingone or more file(s) contained in the database (30), the terminal beingable to return a prompt selected by the user, thereby bringing about thedownloading of a multimedia file corresponding to the prompt selectedfrom the database (30) to the terminal (50; 60, 70) via the mobiletelephony network (10).
 2. System according to claim 1, characterized inthat the voice recognition device (40) is able to generate and transmitto the terminal (50; 60, 70) a list containing several most probableinterpretation prompts.
 3. System according to claim 2, characterized inthat the prompts being associated with probabilities of correspondencewith the user's request, the prompts of the list of prompts are rankedaccording to their order of probability.
 4. System according to one ofclaims 1 to 3, characterized in that the prompts are transmitted to theterminal (50; 60, 70) in the form of hypertext links tied withmultimedia files contained in the database (30), the user being able toactivate the link corresponding to his request.
 5. System according toone of the preceding claims, characterized in that it comprises meansfor recording the voice request.
 6. System according to one of thepreceding claims, characterized in that the terminal (50) is a mobileterminal having a voice channel and/or a data channel.
 7. Systemaccording to one of the preceding claims, characterized in that theterminal (50) includes an Internet browser.
 8. System according to oneof the preceding claims, characterized in that it comprises means foractivating or deactivating the mode of operation with return ofinterpretation prompt(s) to the terminal (50; 60, 70) and: in the casewhere this mode of operation is activated, the voice recognition device(40) is able to return one or more interpretation prompt(s) to theterminal (50; 60, 70), in the case where this mode of operation isdeactivated, the voice recognition device is able to transmit aninterpretation directly to a server (50) for access to the database. 9.System according to claim 8, characterized in that the terminal (50; 60,70) comprises means for measuring a parameter relating to the quality ofthe network and as a function of this parameter, activating ordeactivating the mode of operation with return of prompt(s).
 10. Systemaccording to claim 8, characterized in that the means for activating ordeactivating the mode of operation with return of prompt(s) to theterminal (50; 60, 70) can be actuated by a user of the terminal (50; 60,70).
 11. Process for downloading multimedia content to a terminal (50;60, 70), characterized in that the downloading is carried out via amobile telephony network (10), the said terminal being able to beconnected to the mobile telephony network (10), said process comprisingthe steps according to which: a user transmits a signal corresponding toa verbal request to a voice recognition device (40) from a terminal (50;60, 70) via the mobile telephony network (10), the voice recognitiondevice (40) processes the signal and returns to the terminal (50; 60,70) one or more interpretation prompt(s) designating one or moremultimedia file(s) contained in a database (30) connected to the network(10), the user selects the prompt corresponding to the verbal request,thereby bringing about the downloading of a multimedia filecorresponding to the prompt selected from the database (30) to theterminal (50; 60, 70) via the mobile telephony network (10).
 12. Processaccording to claim 11, characterized in that the voice request signal isa voice or data signal.
 13. Process according to one of claims 11 or 12,characterized in that the prompts are returned from the database (30) tothe terminal (50; 60, 70) in the form of a text message.
 14. Processaccording to one of claims 11 or 12, characterized in that the promptsare returned from the database (30) to the terminal (50; 60, 70) in theform of a voice message transmitted as a sound file or by audiostreaming.
 15. Process according to one of claims 11 to 14,characterized in that the prompts are presented by the terminal (50; 60,70) in a descending order of probability of correspondence with therequest.
 16. Process according to claim 13, characterized in that aprompt is selected by positioning a cursor over this prompt then bypressing an enable key of a keypad associated with the terminal (50; 60,70).
 17. Process according to one of claims 11 to 15, characterized inthat the user selects a prompt by scrolling prompts down to the onewhose selection is desired and then by pressing an enable key of akeypad associated with the terminal (50; 60, 70).
 18. Process accordingto one of claims 11 to 15, characterized in that the user selects aprompt by pressing a key of a keypad associated with the terminal (50;60, 70) identifying this prompt.
 19. Process according to one of claims11 to 15, characterized in that the user selects a prompt by verballypronouncing a reference identifying this prompt.
 20. Process accordingto one of claims 11 to 15, characterized in that the user selects aprompt by positioning a stylus on a touch screen associated with theterminal, at the level of the relevant prompt.
 21. Process according toone of claims 11 to 20, characterized in that when none of the promptsis selected, the operation of processing the request by the voicerecognition device (40) is repeated while eliminating the unselectedprompts from a list of expressions that the voice recognition device(40) may comprise.
 22. Process according to claim 21, characterized inthat having recorded the voice request beforehand, this new processingoperation is carried out on the basis of the initial recorded request.23. Process according to claim 21, characterized in that this newprocessing operation is carried out on a new request.
 24. Processaccording to claim 23, characterized in that when none of the prompts isselected, the new request is formulated in text or graphics mode. 25.Process according to one of the preceding claims, characterized in thata mode of operation with return of prompt(s) to the terminal (50; 60,70) is activated beforehand.